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(57) Abstract 



Content Lifecycle Management (CLM) is a method for keeping content within an intranet (or internet) up-to-date and relevant for 
users of the website by archiving low-value, unused, inactive, and obsolete content from the site based on a combination of ratings threshold, 
usage threshold, and expiration date. Archiving content removes general access to that content, but retains it in the site for searches specific 
to the archive. CLM uses these threshold values as a method for providing automated feedback and notification to the publisher and/or 
author of aging or poorly performing content. 
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ONLINE CONTENT LIFECYCLE MANAGEMENT BASED ON USAGE STATISTICS, USER-SUPPLIED 
VALUE RATINGS AND EXPIRATION DATES 



Field of the Invention 

The invention pertains to a system for maintaining and accessing a diverse 
collection of information to identify and keep information that is current and useful. 

1 0 Background of the Invention 

As information databases become more prevalent and more complete, a 

j 

number of content management issues must be confronted. In particular, valuable 
data often become "lost in the haystack" of lower-quality content. In addition, 
systems rarely remove stale content or provide meaningful feedback to authors or 
1 5 publishers of the value of their content. It is an object of the present invention to 

address these issues by providing methods for winnowing databases to preserve high- 
quality content while archiving or deleting low-quality or stale content. 

Summary of the Invention 

20 In one aspect, the invention pertains to a system for maintaining currency and 

value for a collection of information. When a piece of information (a content item) is 
\ added to the collection, metadata is added to a database specifying the location of the 

information, and threshold values for usage and utility of the information. When the 
database item is accessed to access the information, the user is allowed to specify a 

25 value rating for the content item, and an access log is updated. The frequency of 

access of the content item and its value can thus be monitored to determine if either 
j has fallen below the associated threshold value. If either usage or value rating falls 

below its threshold, the system performs some action, such as warning the author of 
the information that it is not being used or is received low ratings, marking the item 

30 for archiving, archiving the item, or deleting the item from the system. Content items 

may further have associated expiration dates which are given at the time that the 
content item is added to the collection. If a content item is approaching its expiration 

1 
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date, the system may notify the author or perform another action such as archiving the 
item. The system can be used for monitoring diverse collections of information, 
including such items as URLs and web pages, files stored on a fileserver, whole 
directories and subdirectories, and nondigital data such as books. 

5 

Brief Description of the Drawing 

The invention is described with reference to the several figures of the drawing, 
in which, 

Figure 1 shows a sample screen containing content and a ratings entry area; 

10 and- 

Figure 2 shows the content matrix. 

Detailed Description 

Websites today are faced with daunting content management issues including 

1 5 the proliferation of old and stale content and poor capabilities of providing feedback 

and notification to the author. Content Lifecycle Management (CLM) is designed to 
eliminate this proliferation by keeping content that is displayed on an intranet site 
relevant and up-to-date. This goal is accomplished by tagging each item of content 
recorded in the content management database with an expiration date and threshold 

20 values. Threshold values are minimally acceptable levels in order for content to 

remain current and available. CLM uses at least 2 thresholds: one for the rating (value 
of the content) and one for the usage ( how often each item is viewed.) 

When content is published to the intranet, it is tagged with content thresholds 
for ratings and usage which the author or publisher feels represent minimum values. 

25 When content falls below either value based on a given timed period (for example, 

weekly basis), the CLM will notify the author of this fact, for example by e-mail, 
from within the CLM application, or both. The tabulated values which are required 
for this are collected from users who view and rate the content on the site. Figure 1 is 
a screen shot illustrating how feedback is collected. Each time a user views a 

30 document (or item of content) it is registered in the usage table. When a user rates the 

content using the ratings engine 10 at the top of the window, it is also recorded 

2 
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associating the content ID with the user ID, date, and rating. Rating of content may 
be either optional or required for the user. 

The maximum and minimum values of the two thresholds form the CLM 
matrix, shown in Figure 2. Based on the rating and usage values, each content item 
5 can receive one of four classifications: (1 ) active or good content 20, which means 

that this item of content is receiving high usage and that users find it of high value; 
(2) needs promotion 22, which means that users value the content but usage is low 
(usually because users can not easily find the content); (3) needs updating 24, which 
means that users access the file frequently but that it is not of significant use or value; 
10 and (4) archive content 26, which is the default action for content that is not being 

viewed often and is of low value. 

The ratings and usage of content in combination with its expiration date serve 
as an effective indicator for archiving content. 

The inventive system for maintaining currency of the information dovetails 
1 5 well with systems for targeting information to particular users or groups of users. For 

example, the methods described herein may be used with the methods described in 
U.S. Provisional Application No. 60/129,106, filed April 13, 1999, and U.S. Patent 
Application "Group Targeted Content Personalization," (attorney docket 2001774- 
0001), filed on even date herewith. 

20 

Example 

In one embodiment of the invention, Content Lifecycle Management is based 

on a set of database tables that are part of an overall intranet database. The following 

data are cataloged in the database tables: 
25 • Content information - may include ID, name, file name, content type, 

publish and expiration dates, author/publisher, approval and archive status, and 

ratings/usage thresholds for each content item 

• Ratings - may include content ID, user ID, date, rating, and module in 

which the content was rated 
30 • Usage - may include content ID, user ID, date, and module in which the 

content was viewed 
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• Author/Publisher Info - may include e-mail address for notification by e- 

mail 

The following data are stored on durable media such as a hard disk: 

• Content, which may include documents, files, data, executables, and 
catalogue entries or other references to nondigital content. 

• Archiving Daemon (code) 

• Low-Usage Daemon (code) 

• Low-Rating Daemon (code) 

■ • CLM Matrix Daemon (code) 

Content Publishing 

Content is published to the intranet using the publishing functionality within 
the system. During the publishing process, the file may be uploaded and stored on the 
server and all file-specific and publisher-entered information as well as the 
association between the file and the database entry is saved in the database tables. 
Data specific content such as web links (URLs) are stored in the database and the 
information is associated with the database entry. The user-entered information may 
include items such as the classification of the document using the topic and subtopic 
parameters, access classification (who has access to view the document), target 
classification (who would most benefit from this content), expiration date and the 
rating and usage thresholds for the content. (Access and target classification are 
discussed more fully in U.S. Provisional Application No. 60/129,106, filed April 13, 
1999, and U.S. Patent Application "Group Targeted Content Personalization," 
(attorney docket 2001774-0001), filed on even date herewith). 

Archiving Daemon 

The Archiving Daemon comprises executable code which is run as a 
scheduled task on a periodic basis (set by the administrator). In one preferred 
embodiment, it is set to run on a nightly basis. Each time the archiving daemon is 
run, it makes two comparisons. First, it compares the expiration date of each record in 
the content library with the current date. If the current date matches the expiration 



4 
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date, the record is archived and an e-mail is sent to the author notifying the author that 
the record has been archived. At this point, the content that has been archived is no 
longer available throughout the site unless a search on archived content is executed. 
If the current date represents a period of a week before the expiration date (or 
5 any other suitable time period), the archiving daemon sends an alert e-mail to notify 

the author that the content is aging and will be retired at the end of the week unless it 
is updated. 

Low -Usage & Low-Rating Daemons 

1 0 These daemons are used to notify the author if content the author has 

published falls below the rating and usage thresholds set when the content was 
published. Like the Archiving Daemon, they may be set to run on a periodic basis 
(e.g., nightly). These Daemons may further directly archive content which falls 
below one of the thresholds (e.g., if the author/publisher does not respond to an 

1 5 invitation to improve the content). 

CFM Matrix Daemon 

The Content Lifecycle Management Daemon compares the averaged ratings 
and the usage values of each record in the content management database against the 
20 administration-set values for both the rating and usage. The administration set values 

are the index values and allow the matrix to be configured to a particular group of 
users. If the rating and usage values fall above the index values for both ratings and 
usage, no notification is sent. If either the usage or the ratings values fall below their 
respective index value, a notification is sent. 

25 

Other embodiments of the invention will be apparent to those skilled in the art 
from a consideration of the specification or practice of the invention disclosed herein. 
It is intended that the specification and examples be considered as exemplary only, 
with the true scope and spirit of the invention being indicated by the following claims. 
30 What is claimed is: 



5 
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1 1 . A computer-implemented method of maintaining currency and value of a 

2 collection of information, comprising: 

3 adding an item corresponding to a single piece of information to a database of 

4 metadata pertaining to the collection of information, where the added item includes 

5 an identifier indicating a location for the piece of information; 

6 a minimum value level for the piece of information; and 

7 a minimum usage level for the piece of information; 

8 accessing the added item in the database of metadata, where accessing 

9 includes 

1 0 using the identifier to access the piece of information; 

1 1 updating a record of the actual usage level for the piece of information; 

12 and 

13 allowing updating of a record of the actual value rating for the piece of 

14 information; and 

15 performing an action in response to a condition in which 

16 the actual value rating is below the minimum value level; or 

17 the actual usage level is below the minimum usage level. 
18 

19 2. The method of claim 1 , wherein the item further includes an expiration date 

20 for the piece of information, and wherein an action is performed in response to a 

21 condition in which the actual date is within a selected time period from or is equal to 

22 the expiration date. 
23 

24 3. The method of claim 1 or 2, wherein the action is selected from the group 

25 consisting of: 

26 removing the item from the database of metadata; 

27 marking the item for archiving; 

28 placing the item in an archive database; and 

29 notifying a user that the condition exists. 

30 4. The method of claim 1 , wherein the identifier is selected from the group 

3 1 consisting of uniform resource locators, file location paths, directories, subdirectories, 

32 and catalogue entries for nondigital information. 

6 



xJSDOCID: <WO 0062204A1_t_> 




SOOCID: <WO 0062204A1_I_> 



t 
! 



WO 00/62204 PCT/US00/09767 

2/2 



CD 
CD 
03 
CO 



CO 



CO 

01 







© >.-*-» 

" o 

v. 


Promote 


Update 


<D 
> 

O 
< 

f 



X 



xsi 

ex x 
c o 



s 



CD J 



S 

E 



s 



^9 

or 



MSOOCID: <WO 0062204A1J_> 



INTERNATIONAL SEARCH REPORT 



Int. lonal Application No 

PCT/US 00/09767 



A. CLASSIFICATION OF SUBJECT MATTEB 

IPC 7 G06F17/30 



According to International Patent Classification (IPC) or to both national classification and IPC 



a FIELDS SEARCHED 



Minimum documentation searched 

IPC 7 G06F 



(classification system followed by classification eymbote) 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 

WPI Data, EPO-Internal, PAJ, INSPEC, COMPENDEX, IBM-TDB 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ' 



Citation of document, with Indication, where appropriate, of the relevant passages 



Relevant to daJm No. 



EP 0 515 073 A (HEWLETT PACKARD CO) 
25 November 1992 (1992-11-25) 
abstract 

column 2, line 23 - line 48 

column 5, line 24 -column 9, line 32; 

figures 2-4 

GB 2 327 787 A (KNOWLEDGE HORIZONS PTY 
LTD) 3 February 1999 (1999-02-03) 
abstract 

page 5, line 21 -page 6, line 18 
page 7, line 30 -page 8, line 9 



1,3,4 



1,2,4 



□ 



Further documents are listed In the continuation of box C. 



Patent family members are listed In annex. 



* Special categories of cited documents : 

"A" document defining the general state of the art which is not 

considered to be of particular relevance 
"E" earlier document but published on or after the international 

fifing date 

*L - document which may throw doubts on priority claim(s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

*0* document referring to an oral disclosure, use, exhibition or 
other means 

*P" document published prior to the International filing date but 
later than the priority date claimed 



T later document published after the International filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

a X* document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y* document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of the actual completion of the international search 



23 August 2000 



Date of mailing of the international search report 



30/08/2000 



Name and mailing address of the ISA 

European Patent Office, P.B. 5616 Patentiaan 2 
NL - 2280 HV ROswi* 
Tel. (431-70) 340-2040. Tx. 31 651 epo rd. 
Fax: (431-70) 340-3016 



Authorized officer 



Polzer, A 



Form PCMS/U210 (sooond atoet) (Jury 1002) 



SDOCIO <WO 0062204A1J_> 



INTERNATIONAL SEARCH REPORT 

Information on patent family member* 



tnl tionai Application No 

PCT/US 00/09767 



Patent document 


Publication 




Patent family 


Publication 


cited tn search report 


date 




members) 


date 


EP 0515073 A 


25-11-1992 


US 


5313631 A 


17-05-1994 




DE 


69224678 0 


16-04-1998 






DE 


69224678 T 


02-07-1998 






OP 


5158770 A 


25-06-1993 



GB 2327787 A 03-02-1999 AU 6905498 A 03-12-1998 



Form PCTASA/210 <p*«rt Umly «n«w) (July 1002) 



^SDOCID: <WO_0062204A1_L> 



